Distribution-Based Query Scheduling

نویسندگان

  • Yun Chi
  • Hakan Hacigümüs
  • Wang-Pin Hsiung
  • Jeffrey F. Naughton
چکیده

Query scheduling, a fundamental problem in database management systems, has recently received a renewed attention, perhaps in part due to the rise of the “database as a service” (DaaS) model for database deployment. While there has been a great deal of work investigating different scheduling algorithms, there has been comparatively little work investigating what the scheduling algorithms can or should know about the queries to be scheduled. In this work, we investigate the efficacy of using histograms describing the distribution of likely query execution times as input to the query scheduler. We propose a novel distribution-based scheduling algorithm, Shepherd, and show that Shepherd substantially outperforms state-of-the-art point-based methods through extensive experimentation with both synthetic and TPC workloads.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

EM-KDE: A locality-aware job scheduling policy with distributed semantic caches

In modern query processing systems, the caching facilities are distributed and scale with the number of servers. To maximize the overall system throughput, the distributed system should balance the query loads among servers and also leverage cached results. In particular, leveraging distributed cached data is becoming more important as many systems are being built by connecting many small heter...

متن کامل

A new approach in graph- based integrated production and distribution scheduling for perishable products

This study is concerned with how the quality of perishable products can be improved by shortening the time interval between production and distribution. As special types of food such as dairy products decay fast, the integration of production and distribution scheduling (IPDS) is investigated. An integrated scheduling of both processes improves the performance and costs because the separated sc...

متن کامل

Scheduling Post-Distribution Cross-Dock under Demand Uncertainty

The system of distribution of goods and services, along with other economic developments around the world, is rapidly evolving. In the world of distribution of goods, the main focus is on making distribution operations more effective. Due to the fact that the cross-dock has the advantage of removing intermediaries and reducing the space required for the warehouse, it is worth considering. Among...

متن کامل

DEMB: Cache-Aware Scheduling for Distributed Query Processing

Leveraging data in distributed caches for large scale query processing applications is becoming more important, given current trends toward building large scalable distributed systems by connecting multiple heterogeneous less powerful machines rather than purchasing expensive homogeneous and very powerful machines. As more servers are added to such clusters, more memory is available for caching...

متن کامل

Optimization of a Bi-objective Scheduling for Two Groups of Experienced and Inexperienced Distribution Staff Based on Capillary Marketing

Developing an appropriate plan for distribution department is significant because of its influence on company's other costs and customers' satisfaction. In this study, a new bi-objective mix-integer linear programming model developed for scheduling two groups of experienced and inexperienced distribution staff based on capillary marketingin Pak Pasteurized Dairy Products Company of Guilan provi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • PVLDB

دوره 6  شماره 

صفحات  -

تاریخ انتشار 2013